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This  paper  describes  several  parallel  algorithms  that  solve  geometric  problems, 
The  algorithms  are  based  on  a  vector  model  of  computation  -  the  scan-model. 

The  purpose  of  this  paper  is  both  to  show  how  the  model  can  be  used  and  to  sho\ 

a  set  of  interesting  algorithms.  ’ 

-We-describe  a  k-D  tree  algorithm  that,  for  n  points,  requires  0(lg  n) 
calls  to  the  primitives,  a  closest-pair  algorithm  that  requires  0(lg  n)  calls 
to  the  primitives,  a  line-drawing  algorithm  that  requires  0(1)  calls  to  the 
1700  a  Unp-nf-siehr  algorithm  that  requires  0(1)  calls  to  the 
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primitives,  and  finally  three  different  convex-hull  algorithms.  All  these 
algorithms  should  be  noted  for  their  simplicity  rather  than  complexity; 
many  of  them  are  parallel  versions  of  known  serial  algorithms. 

Most  of  the  algorithms  discussed  in  this  paper  have  been  implemented  on 
the  Connection  Machine,  a  highly  parallel  single  instruction  multiple  data  (SIMD) 
computer . 
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Abst  fact 

L  his  [lapef  descrihes  sevcaal  paitillel  alf^uiitliiiis  lliat  sol\'c  geoiiieiric  problems.  1  he 
algorillims  arc  btiscd  on  a  vector  model  of  computation  -  th('  scan-nwdcl.  TIk;  purpose 
(/!'  this  [lapcr  i.s  Ixjt  li  to  show  liow  llic  ijiode)  can  be  used  ttiul  to  show  a  set  oi  iiilerostiug 
;d,i;orit  lims, 

W'e  describe  a  ti('e  algorithm  th.al.  for  »  points,  re([uir('.s  O(lgtt)  calls  to  tlie  prim¬ 
itives,  ii  closest-p.iir  algorithm  that  rcHiuires  (){lgn)  calls  to  the  primitives,  a  line-drawing 
algorithm  that  rcHpiires  0(1)  calls  to  the  i)rimitives.  a  line-of-sight  algorithm  that  requires 
0(1)  calls  to  tlie  i)ritiiitives.  and  (imdiy  three  dilferent  convex-litdl  algorithms.  All  these 
algorithms  should  be  noted  for  their  .-.imprn  ily  ratln'r  than  comple.xily;  many  of  them  are 
p.iralh'l  versions  of  known  sc'rial  algorithms. 

.Mo-it  of  t  he  algoi'it  hms  <liscu.^sed  in  t  his  pa|)er  h.ive  Inum  imphmumted  on  t  he  Connection 
.Machine.  ;i  highly  pa  rallel  .^illgh'  inst  ruci  ion  midt  i  ph'  dat  a  ( .SI  .\1 1)  )  computei'. 

.'VckiiouleclgeiiicMits:  I  his  report  (h^■'Cri))e.-,  le'.f'.arch  done  within  the  .Artificial  hit(d- 

ligeiice  l,,iboral(jr\’  at  the  .M  as.',a(  h  usel  t  s  lii',tiiiite  of  Technology.  .Support  lor  t  lu'  .A.l. 
L.i  iHJiatoiv's  art  ilici.al  intelligence  reseaiadi  is  jnovifh'd  in  j)art  by  the  .Advanced  Research 
I’loji'cis  .Agency  ol  the  Department  of  Defense  under  .Army  cont  ract  D,A( '.A7(j-S.a  COOK) 
and  in  part  undei’  the  Ollice  of  Naval  Research  contract  .N’OOOl  l  S.a  1\  0121. 

'c  .\  I  a -sac  h  Use!  t  s  1  list  it  n  I  ('  ol  1 1'ldmology.  lO.ss 
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1  Introduction 


'I'lio  |)uiin)><'  oi'  this  jiaptT  iwotulil.  l  iislly.  it  dcscrilx's  a  sot  ol' ('logaiit.  practical  alj^o- 
ritliiu^  lur  solviiiy;  a  divorso  sot  of  ])rul)loiiis  in  coiiiptitatioiial  “ooiiiotry  and  ‘:,rai)liics.  Soc- 
oncll)'.  it  liolps  doiuon>t into  tliat  tlio  .s<(in-inuil(  /is  a  viahlo  inodol  ofcoiiipulation.  I'lioso  two 
purposes  coiniiloniont  oach  other:  the  model  allows  a  sim|)lo  description  of  the  alaorilhms. 
.  a.nd  the  algorithms  demonstrate  the  pt)wer  of  the  model. 

Hesearchei's  hav('  suggested  sevei'.d  .sy nc h ionous  |)aralhd  models  of  computation.  1  he 
most  ixipufa'r  of  these  models  are  the  i)arallel  landom  access  machine  ( l’-R.\.M  )  models  [l-l]. 
■  ■  A  P-R'.\^i  consists  of  a  set  of  conventional  {)rocessors  attached  to  a  single  shared  memoiy. 

Pjjoco.ssors  coinmunlcato  through  the  shared  memory:  one  processor  can  write  a  value  into 
^  iiiemoijj^  Hiid  Jiiiother  ])rocossor  can  read  this  value.  Researchers  have  suggestc'd  several 
^variations  of  the  P-R.\.M  models.  These  variations  mostly  differ  in  whether  or  not  they 
permit  concurrent  reads  from,  or  concurrent  writes  to.  a  uni<|ue  memory  location.  Hy 
jassumlig  that  memory  references  take  uiiit-liiiic.  the  I*-R.\.\I  models  havi'  Ix'en  used  to 
determine  the  asymptotic  ruiuiing  time  of  many  parallel  algorithms. 

W'e  suggest  a  not  her  class  of  synchronous  i)arallel  models  of  compiitat  ion  defined  in  terms 
of  a  set  of  i)ritnitive  operations  that  work  on  arbitrarily  long  vectors  of  simi)le  values.  W'e 
call  these  models,  vector  moch'ls.  The  models  differ  from  P-R;VM  modt'ls  both  in  that 
they  are  single  instruction  multiple  data  (Sl.MD)  models,  and  in  that  there  is  no  concept 
of  a  memory  shared  among  many  processors.  FJemetits  in  a  vector  communicate  through  a 
permutation  primitive  rather  than  a  shared  memory.  As  with  tlie  P-R.'VM  models,  vector 
models  can  Ire  used  to  analyze  the  asymptotic  running  time  of  algorithms,  by  assuming 
th;u  a  set  of  primitives  take  unit-lime. 

Since  vector  models  are  SIMh).  they  can  be  oflicienlly  mapped  onto  a  widi'r  range 
of  ai(  hitect  ui'i's  lh;in  l’-R.\M  models  can.  .As  well  as  being  imirlementa  ble  on  standard 
sei'ial  ('(Jiniiuters  and  on  multiph'  instruction  parallel  com[)Ulers.  they  can  he  efiiciently 
i  in  pigment  I'd  on  \e<tor  procrs.-.oi's,  ^uch  a>  tin'  \<-ctor  |>roci'ssor  oi  the  ('R.A\  si.^tems  [2  ij. 
or  .'i  nyh'  i  n.i  I  rue  i  ion  p.i  lal  lei  com  p  u  t  ers.  .^uch  a.-'  t  he  ( 'on  nect  ion  M  ac  hine  [  1  (i].  On  t  he  ot  her 
hand,  'iiue  l’-R.\.M  model-  .m'  mult  iple  in.'.t  i  mt  ion  multiple  data  (.\11MI))  modeh,  they 
aie  nioie  poweifiil  than  vectoi  lUodeN.  .\s  should  become  evident  in  this  paper,  ami  as 
sl.ovui  el-ex'.-hei e  [ll)J.  iliis  addiiion.il  power  i.--  not  necessary  lor  a  broad  I'ange  of  ])r<ictical 
a  Igoi  i  I  h  1 1 1." .  We  .d-,o  iielji'M'  that  \'e(|oi  modi'h  tend  to  le;ul  to  simpler  ;unl  more  coiicreie 
algoiiihm  ihsi  liptioiis  ihan  do  l’-R.\.M  models. 

I  l.e  .-.((I II - iiiiiil(  I  is  a  p.iiiii  ular  M-i  toi'  model  in  which  thri’e  classes  ol  \'ectoi'  opi'iio 
tion-  .lie  (  oiisiden'il  iiiiil-liiiii  |)riiiiit  i\es:  element  wise  aril  h  met  ic  and  logii.il  operations: 


.v.S. 


ponnulatiuii  o|)('iatiuiih:  and  scan  oijoratioiis.  a  type  of  prefix  coniirutation.  L5y  unil-tinn 
p/iinitires  \ve  iiiean  that  they  reriuire  apjrroximately  an  equivalent  duration  of  time  wlmii 
executed  on  equal  length  vectors. 

In  this  i)aper  we  describe  several  algorithms  based  on  the  scan-model.  The  hist  is  an 
algoiithm  that  constructs  a  /c-D  tree.  A.  /c-D  tree  is  a  technique  for  splitting  ti  points  in  a  k 
dimensional  space  into  n  regions  each  with  a  single  iioint.  The  A'-D  tree  technitiue  is  used 
as  a  siihslep  in  a  laigi'  nninlnT  of  ;i|)plication.s  ranging  from  rendering  images  to  machine 
learning  [20].  l  or  n  points,  the  algorithm  wedescribe  takes  0{k\gn)  calls  to  the  ])rimitive.s 
u.-^iiig  vectors  of  length  0{ii).  'I'his  algorithm  is  optimal  in  the  sense  that  even  if  simulated 
on  ,t  seiial  m.H'hiiie.  it  will  nin  in  the  s.ame  asymptotic  running  time  as  the  best  serial 
algoi  it  hm. 

liasi'd  on  the  A’-D  tree  .ligorithm,  we  describe  a  two  dimensional  closcst-pair  algorithm. 
In  the  two  dimensiontil  clo.se.st  pair  problem  we  want  to  find  the  jiair  of  [loiiits  in  a  phine 
that  aic  close.'t  to  each  other  (Euclidean  distance).  Thi.s  iilgorithin  is  a  parallel  veision 
ol  an  tilgoiithm  of  lieiitley  and  .Shamos  [9].  l-'or  n  points  in  a  two  dimensional  sptice.  our 
algoiithm  re(|uir(’s  tA(lgn)  ctdls  to  the  primitives  using  vectors  of  length  0{n). 

1  he  ihitd  algoritlim  is  a  line  drawing  routine.  Line  drawing  is  the  problem  of:  given 
a  pail  of  points  on  a  two  dimensional  grid  (the  two  endpoints  of  a  line),  determine  what 
pi.\e|>  ill  a  lin/ie  resolution  grid  lie  on  a  line  between  tlte  endpoints.  This  routiite  requires 
(>\  1  I  (alls  to  th"  primiti\-es  using  vectors  no  longer  than  the  number  of  iroints  in  the  line. 
1  he  roiitine  has  been  e\tend(>d  to  reinb'i'  solid  (jbj('cts  [25]. 

1  he  rt/iiiih  algoiithm  is  a  line  of  sight  algorithm.  CJiven  a  grid  of  altitudes  and  tin 
olwe)  \  ,1 1  ion  poi  lit  oil  t  he  grid,  the  algorit  hm  n't  urns  the  ])oints  visible  from  the  observation 
point,  .\  line  of  sight  .ilgoiii  lim  can  bt' aj)|)lied  to  hel])  determine  wher('  to  locate  j)Otenlial 
e>e,-oi(‘s,  l  or  example,  when  designing  a  building,  a  highway  or  ;i  city  dump,  it  is  often 
intoi mat  i\e  to  know  litim  where  tin'  ■'eyescnu,'"  will  be  visible. 

W’e  li  iia  lly  d  I 'SI  I  i  be  I  h  re('  pin  na  r  con  vex- hull  algorit  hnis.  Cli  ven  //  points  in  the  plane,  t  he 
pl.iiiai  I  onM'x  iiidl  |iioblem  tiiids  which  of  thes('  points  lie  on  the  ])erimeter  of  tin'  smalh'st 
i  oii\,  V  |.  .j1mii  i!i;iI  (  iMil.i ill.',  all  |)oinls.  'I  Wo  id  the  couvex-liull  algorit  hms  we  describe  are 
-iiiiple  .iiiil  ,iie  likely  to  perform  very  well  in  practice,  but  they  are  not  pn.ivably  optimal 
(.Main  seis  of  laielidlv  .-.eh'cled  point--  will  pi'i  form  badly.  The  third  algorithm  is  more 
I  .  Miipln  ateil  and  |,|ob,ibly  h's-,  |)iactical.  but  is  theoretically  optimal.  This  algorithm  is 
b.i'.  i!  on  .1  p.ir.dh'l  .ilgoiiihm  designed  lor  the  concurrent  read  exclusive  write'  (CHEW  ) 
I’  K.A.M  model  [1,11. 

.\lo-.l  ol  t  h  e  ,d._;(  ,|  i  I  h  Ills  wedesciibe  in  llil'  |)a|ier  htive  bei’ll  imidemented  on  the  ('on- 
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iic'c  lioii  Macliiuo.  1  lie  code  wo  sliow  m  Iho  loxi  witli  some  syntactic  clianf>;o.s  is  aclual  code 
iisi'd  to  oxociito  the  algoritluiis. 

riio  remainder  of  this  paiier  is  organized  as  follows: 

•  V\’e  define  the  scan-model  in  terms  of  the  primitive  oi>erations  it  snpijorts. 

•  W’e  introduce  some  powerful  techniriues  based  on  the  scan-model.  'I'liese  tecliniijues 
are  useii  extensively  in  the  descri[)tion  of  algorithms. 

•  W’e  descrilie  the  algorithms. 


2  The  Scan-Model 

The  scan-model  is  defined  in  terms  of  a  set  of  primitive  ojierations  that  operate  on  arbitrarily 
long  vectors  of  atomic  values,  lly  a  vector  we  mean  a  one  dimen.sional  array  (an  ordered 
set).  Hy  atomic  values  W(>  iman  valiH's  that  can  be  represented  in  (.tllgn)  bits  in  this 
])apc‘r  we  oidy  use  int('gei>.  floaling  point  numbers  and  booh'an  values,  W’e  assmm,'  that  all 
l)rimitiv(>s  reciuire  apinoximately  an  erpiivalent  duration  of  time  when  operating  on  ecpial 
length  vectors.  W'e  call  this  tiiiu'  ''unit  tiim'".  'I'o  determine  the  tictind  running  time  of 
an  algorithm  on  a  /jarticiilar  mai  him',  wc'  need  to  know  both  the  numbi'j-  id' ctills  to  the 
primitives  and  the  length  of  the  vectors  itsc'd'. 

The  scan-model  h;is  threi'  classes  of  unit-time  primitive's:  elementwisi'  arithmetic  and 
logical  operations,  pei'mutation  operations,  and  scan  operations,  a  tyjx'  of  jiartillel  jiiofix 
crmipul  iition. 

Eleiiiciitvvi.se  Priinitiv  es 

I'iach  elementwise  pi'imilivc  operates  on  etpi.d  length  vectors,  pioducing  a  result  vector  ol 
e(|nal  h'liglh.  1  he  element  i  ol  tie'  result  is  .in  ('lemeiil a rv  aiitlinielic  oi  logii  al  piimilive 
such  as  f.— .r.  or  and  not  applied  to  eh'inent  i  ol  I'acli  ol  the  iiijint  ecctois.  for 
example: 

'  lit'  Vf't'.i  I-,  I  Ml  [ini  1 .1  It  I  f\in  lilt  iii.trliiiK's  siiio-  (or  siillniciil  l\  loiit;  \iiltas.  iniillipU- 

iK  111'  III-  iiiii-l  I"  .ill,M.ili,l  I,.  '.I'll  pi'll  I  >.-nr  .111(1  r.ii  |i  piiHc.-Mir  imisl  limp  ovn  llnsr  i  liiiiciits  ivlnii 
'■M  ,  II  line  .III  .  iprr.i  I  lull 
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A  +  n  =  [7  G  G  12  1  12  S  s] 

A  X  1!  rr  [10  .')  !)  21  3  27  12  12] 

Ill  lulditioii  to  till'  staiulaitl  elciiK'iilary  ojicratioiis,  wo  iiicIiKlo  an  ojx'ratoi-  select  that 
takes  one  boolean  ai>;,iiiiieiit  and  two  oilier  ii  rji.iiiiieiit.s.  llasetl  on  the  boolean  ai<;iuiienl . 
the  select  fuiietion  will  retiini  either  the  first  or  second  of  tlie  other  two  argiinieiils. 


A 

B 

1' 

select! F.  A.  B) 


[T  F 


1 

3  8 
F  F 

3  8 


3  9 

1  3 

T  T 

3  9 


2  6] 
G  2] 
F  T] 

G  6] 


Perntutatioii  Primitives 

I'he  jx.i  uialulion  [iriiiiilive  ttikes  two  vector  iir.yiiineiits  --  a  dala  ctcfor  tuicl  an  index  vector 
tiiid  periiiutes  each  eleiiM'iil  in  (he  dala  vector  to  the  lociition  specified  in  the  indi'x 
Vector.  For  e.xtuiipie; 


\’ec;lor  Index 

.V  (dala  \  I'ct  or ) 

1  t  index  Vec  tor) 


permute!  A,  I ) 


[0  1  2  3  4  5  G  7] 

[o  I  e  III  e  r  g  y] 

[2  ••)  4  3  1  G  0  7] 


e  o  III  e  1  r  y] 


It  is  an  error  lor  iiioii’  ihan  oin'  eh'inent  to  have'  the  saiiK'  iiulex  — -  the  permutation 
must  be  one-to-one.  this  re.-.t  riel  ion  is  similar  lo  the  re.si  riction  made  in  the  ('xcliisive  read 
ex<liisi\e  write  |l  lil.W  )  I’-K.A.M  model,  in  whiih  it  i.s  ;\ii  ('nor  to  write  mon'  than  one 
v.i.'iie  loa  paila  nlai  nieniois  hualion  al  a  lime. 

lo  allow  Com  III  II  liica  t  ion  belweeli  \ecl,,m  ordill'erelll  .si/('S.  W(‘  include  a  VI'Isioli  of 
lie  permute  priiniiive  ihal  leliirns  a  viihii  ol' d iHerenl  |eii'u,th  than  the. source  '-eclors. 

1  his  veiioii  lake-,  two  extra  .1 1  gii  men  I  s:  .i  di  jmill  lirlor.  wliich  specifii's  the  h'ligth  of  the 
(lest  i  na  1  ion  vector  and  put'  di'laidi  values  in  positions  that  do  not  receive  any  value:  and 
a  ■^(hili'iii  riclcr.  which  masks  onl  (i  ilaiii  I'leiiieiits  so  tlmt  tln'V  do  not  |K'lliiiile.  I'or 
e:-.a  III  p|e: 


•,  V  ■ 


=  [0  12  3  1  5  6  7] 

=  [o  t.  o  III  ('  r  y  ;;] 

=  [r  r  e  e] 

=  [TFFTFFFF] 
[2  ')  4  3  1  (i  7  0] 


Vector  Index 

A  (data  vector) 

D  (default  vector) 

S  (selection  vector) 
I  (index  vector) 


permute(A,  I,  S,  D)  =  [f  r  o  iii] 

Scan  Primitives 

d  lic  scan  iiriiiiitive.s  execute  a  scan  operation,  sonietiines  called  a  prefix  coiiijuitatioii. 
on  a  \eclor.  ilie  scan  operation  takes  a  binary  associative  ojierator  s£-.  and  a  vector 
[uy.  (ii , of  /i  elements,  ami  returns  the  vector  [ay.  (oo'®  («ofr  «i  m ...  A  u,i_] )]. 

In  this  [laper  wi'  will  only  Use  plus,  maximum,  minimum,  or  and  and  as  operators  for  tlie 
scan  priiniti ve.s.  W’e  v^ill  heucefortli  call  these  scan  operations  -F-scan,  max-scan,  min-scan, 
or-scan  and  and-scan.  Some  examples; 


+  -scan(A)  =  [r>  G  9  13  IG  2.3  27  3.3] 

max-scan(.\)  =  [5  b  b  5  .3  9  9  9] 

Some  readers  might  be  skeptical  about  considering  the  scan  operations  as  "unit-time'’ 
primiiives.  Our  justi (icat ion  is  straightforward.  On  a  serial  machine,  it  is  clear  that  the 

scan  operation.-,  using  simple  operarors  siicli  as  -f  will  be  just  as  fast  as  the  ct  her  primitives: 

all  the  primitives  will  taki'  0(»)  tiiii(>  on  vc'ctors  of  length  ».  On  a  parallel  machine  it  is 
not  hard  to  show,  boih  in  theory  and  in  pr.iitice.  that  a  circuit  that  executes  the  sctin 
opein.lions  can  be  built  with  less  hardware  and  will  rnii  just  as  last,  or  faster,  than  a  circuit 
that  execiiie-  a  ri’.id  or  write  into  ,i  sliaix'd  iiiemorv  (such  a  read  or  write  c;in  be  used  to 
imideuieiit  the  permut.atioii  piimilivei.  t  his  is  argued  in  more  deta.il  in  [11],  Ailmitteilly, 
both  the  svaii  and  a  shared  memory  reterence  take  at  hxist  Ig  n  real  lime,  but  we  are  only 
arguing  here  th.it  the  priimtires  take  approxitiiately  the  same  amount  of  time  on  equal 
length  vectois. 

In  the  description  o|  algorithms  we  will  olten  loosely  relei  to  vectors  in  which  eticli 
eh'iuei;  t  1  ( 111  t  a  i  II  mole  than  one  a  t  om  ic  v.a  lue,  lor  exa  mple.  we  will  use  \'ec  to:  s  of  poi  n  t  s  in 
a  t  wo  d  I  me  Us  in  11  .i!  sp.ice;  i  ,u  h  jioiiit  has  I  w'l )  values,  nil  .r  .i  nd  y  (  ooi  dinal so  the  lector 

[i3.fi)  (1.  3)  (!),  7)] 


rei)i('M'utb  tlie  tlirco  poiiitb  (:j.  (J).  (-1,  b)  and  (9,  7).  At  tlie  primitive  sucli  a  structure 
v('rtur  would  be  imi)lemeiiled  with  two  vectors  but  a  higher  level  language  could  support 
record-like  vcctcjis  in  which  each  eleiiient  luts  some  constant  miinber  of  values. 

2.1  Segiiieiits 

1  hi.s  section  tlescribes  a  method  tluit  allows  a  prograiniiier  to  take  a  vector  routine^  defined 
to  wiu  k  on  ;i  si),i;le  set  of  data  ;iiid  then  api)ly  it  to  many  sets  in  parallel.  For  example,  il 
\S('  htid  ;i  vector  routine  ihtit  sorted  a  set  of  valties.  we  cotild  apply  it  to  sort  many  sets  ol 
(la.ttL  in  partillel.  Or.  if  we  had  a  vector  routine  that,  given  endpoitits,  detertnines  the  pixels 
on  ;i  line,  we  could  apply  it  to  rlraw  immy  lines  in  |>ariillel. 

I  he  techni(|ue  invohx's  dividing  ;i  vector  into  segments  and  jilacing  oite  set  of  data  in 
e;tch  .segment,  lo  kei'p  trai  k  (jI  ln>w  a  d;it;i  vector  is  segmented,  we  associate  with  the 
data  ve<!tora  .^(  ijiiit  iit-dt  •''('i  t jiliff,  .A  ///-d<  .''Cr/y^/o/' is  itsidf  ;i  v  ecttu'wliK  h  has  <is  man\ 

(  'eiiients  as  segment  .  of  the  dat.i  vecior;  each  of  these  eleimuits  conttiiiis  ;in  iiilc'vei  whit  h 
s  I  lec  i  Ill's  the  lellgl  ll  o|  I  lie  segiii.'nr  '.  i'  ol'  eXa  III  pie: 

,1'  =  hf  1  d  1  d  9  2  ti] 

>ujini  n  I -(l<  >1  ri  jiliir  —  [2  I  2] 

.1  =  id  li  [d  4  3  9]  [2  b] 

lleucelorth.  the  nutation 

.1  =  l.d  1]  [d  4  d  9]  [2  G] 

i.s  shorthand  lor  :i  pair  ol  victors:  the  data  Vector  along  with  its  .'•(  ijiik  nt-il<  ■••ri'ipttji'. 

for  eath  jiiimitive  ol  the  scan-inodel  we  deline  ;t  segmented  vt'ision  that  works  indt'- 
peiideiiily  within  'Mill  segment,  ligiiie  I  shows  examples  of  segmented  versions  ol  the 
pi  j  Illiri\ es,  r|ii‘  se.^llicllledi  Ve|si,i|i  of  the  pe|  m  lit  .1  t  ion  plilllitit'e  liases  its  indices  lelativt' 
I  o  I  he  ilegill  llilig  1  if  e.n  ll  'e'4l||e|;  |  ..,1  )  .lilies  j).'|  lillll  e  il  hill  .1  'eg  men  I  il  is  ail  e|  I  ol'  |ol 

.III  index  ll)  ii'ieo me  oni'idi’  of  ihe  'egiiieiii.  1  he  '|■gm"lll'■d  xei'inn  III  llie  scans  primi 
I  i  \  ( ■'  I  I  ''  I  .  I  r  1  .1  I  I  In'  hei_;i  ll  n  I  I,  ol  e,|C  h  -eg  me|i  I  ' .  I  he  o  gmi'li  I  ei  ]  \  el  si  oil  (it  t  lu'  cl  e  1 !  1  e  11 1  v\  i  SI 
opi  l  a  I  iolis  a  re  11  in'  lia  11  get! . 

■  A  \  I  I  1 1  ll  n  'III  lie  I.'  .1  e  .III  1  III’  il*  ti  in.  1  tit  I  la  110  el  lit'  \  ii  1  ■  .i  pi  iiinl  i  \  es  we  liis.  us'i  si , 

1  lie  I  ,  ■  a  1 '  se  \  i  1.1!  el  lie  r  W.l  Vs  et  1  epi  es,.|i !  I  ll  -a  -  ■  gllie  li  I  '  [  1  t'j  tail  We  Jl  III  i  I  !l  Is  1  ep  ri  sen  I  .1 1  ll  111  I  il  e  lllesl 
I  ell  M  11  ielil  . 

'  .\  si  III  ll  ,1  r  ep*  r,i  I  leii  w.is  sii  -a-gi  'ii  il  [.y  S<  liw.t  1 1  Ji 


A  =  [o  1]  [3  4  3  9]  [2  G] 

D  =  [1  Ol  0  3  1]  [0  1] 

+-scan(-l)  =  [o  G]  [3  7  10  19]  [2  S] 

max-scan(  .4 )  =  (")  5]  [3  1  4  9]  [2  G] 

permute(.4,  B)  =  [1  5]  [4  9  3  3]  [2  G] 

Figure  1:  Examples  of  the  segmented  versions  of  the  primitive  operations. 

All  the  segmented  versions  can  be  simulated  witli  a  small  constant  number  of  calls  to  the 
uusegmented  versions  [10],  but  they  are  so  useful  that  in  practice  they  might  be  in\ph'meuted 
directly.  We  will  henceforth  assutne  that  the  segmented  versions  of  the  pritnitives  are 
themselves  primitives. 

We  now  return  to  the  initial  claim  of  tliis  .'.ection; 

The  Segment  Lemma:  With  a  segmented  version  of  all  the  primitives  of  the  scan- 
model,  we  can  apply  any  routine  delined  in  terms  of  those  primitives  to  work  on  a  single 
set  of  data,  to  multiple  sets  of  d;ita  independently  and  in  parallel. 

Wo  won't  irrove  this  h'liima  in  this  paper,  but  it  should  be  intuiti\e;  a  proof  is  given 
in  [10].  This  lemma  allows  great  simplilication  of  the  code  needed  to  describe  parallel 
algorithms. 

3  Some  Simple  Operations 

In  rhi>  .section  we  de.^cribe  sevortil  useful,  simple  operations  that  can  be  imiilemented  with 
a  small  constant  numlx’r  o|  calls  to  the  primitive  operations  [11].  .^s  with  the  segmented 
w  i.'ioii-.  of  ill.'  piimiiives.  ilii‘.-,e  operations  ate  u.seful  I'liough  that  tlic'y  might  themselves 
be  (  oiisidei ,  i|  pri  III  i  t  i  w.s  a  11(1  be  i  ttl  pletlieilt  ed  directly. 

distribute  rahn^  h 

I  lie  distribute  ojteiatioii  takes  a  xcctorol  i'r//irr.s  and  ;i  vector  of /frir;//;s  ;ind  distributes 
each  wiltie  into  a  segment  of  length  sjrecilied  by  h  )ii/lh8.  I'or  example: 

A  -  [7  3  K] 

I.  =  ;2  1  2] 

[7  7]  [3  3  3  3]  [s  K] 

s 


•  ^ 


distribute!  .\.  I. ) 


•4. 


A  Niiiiilar  <j|)i,'ra  I  ion  \sas  lii'Nt  by  IJatclK^r  [b]  —  lie  calk’d  it  an  iircjiiilar  s|ircad- 

in-. 

index  /( iiijtli.' 

Til'’  index  o|H'ialion  takes  a  vector  of  h  iifjths,  creates  a  sep;niont  for  eacli  len^ili.  and 
i.  iiirii'i  I  lie  iiide.\  ()1  eacli  eknitent  wiihiii  (’acli  se-iiieut.  lor  example; 

1,  =  [2  I  2] 

lndexjl,,i  =  [0  l]  ;j)  1  2  2]  [0  1] 


element  raltii  iii(li<  is 

The  element  ofieration  takes  a  se-iiieiited  vector  values,  and  a  vector  of  indices  one 
element  per  se-inent.  Mach  index  i  is  used  to  extract  the  element  from  the  correspoiidin,y 
>.e;^meut  in  fiiliits.  For  example: 

A  =  [o  1]  [;{  -1  3  9]  [2  6] 

I  =  [0  2  1] 

element!  A.  1)  =  [•'>  3  9] 

''  : -reduce  vahus 

The  redact  operations  takes  a  .se-niented  vector  of  values  nnd  combines  all  the  elements 
in  each  .sc-nient  ii.-^in-  one  of  live  binary  operators:  +.  maximum,  minimum,  or  or  and.  It 
|•etullls  a  vectoi-  with  as  many  eh'iiients  as  se-menls. 

Scime  Examples: 

.\  =:  [:.  I]  [3  I  3  9]  [2  G] 

f-reduce(.\)  =  [G  19  .s] 

max-reducei  .\ )  --  [o  9  (ij 
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pack  rulaci>  Jlaijn 

Tlie  pack  operation  takes  a  scj^inenled  vector  of  valuc.-<  and  a  st'j;inpnted  Ijoolean  \''i  nn 
o(  Jla and  packs  all  the  c'lcinents  with  a  T  in  their  Oap,  into  consecutive  element-^,  deleting 
elements  with  an  F  in  llu'ir  Ihig.  For  e.\;iini)le: 

A  =  [•'j  1]  l-J  d  d  ^1]  [’d  d] 

F  =  [T  F]  :T  F  F  T]  [T  T] 

pack!  A.  1')  =  [5j  [d  !•]  1’  d] 

A  .liinilar  opertition  was  hist  suggested  hy  Hatcher  -  he  called  it  ;in  irregular  r(jinpie> 


split  culiii  >  JJilIJ--' 

1  he  split  operaticjii  takes  a  segnienled  vector  of  cn/ms  and  a  segmented  boolean  vector 
i>(  Jhuj.-.  tind  packs  ;di  the  elenunits  with  an  F  in  tlnur  Hag  to  the  bottom  of  each  segment 
iind  elenmiiis  wiili  ;i  T  in  their  Ibig  to  tlie  top  of  etich  segtiieiit.  It  also  splits  (tach  segment 
in  two  at  the  bonndtirv  between  tln'T  ;ind  F  elements.  For  exaiuitle: 

A  =  [5  i]  [:{  -1  3  9]  [2  ()] 

F  =  (T  Fj  [T  F  F  T]  [T  T1 

split(A,  F)  =  [Ij  b'l;  ll  3j  [3  9j  []  [2  (ij 

W'e  al.-d  (bdiiie  a  delete-split  opemtion  wliich  is  the  stiiiK'  ;is  split  but  d(>leles  any  empt\ 


segment . 


delete-spliti  .\ .  f  i 


[3  F  [3  1  3  9]  [2  (e 

jT  Fj  (T  F  F  Tj  [T  Tj 

[1;  15]  [I  3]  [3  9]  [2  b] 


rank-split  ninL.^ 

1  In  ■  rank -split  opi  i  at  ion  i>  'i  iiiila  r  t . .  tie-  split  opera  t  ion  except  that  the  iiinks  a  rgnmen  i 
mi)>t  be  :i  valid  '^el  ol  indie,'-  lor  the  pel  in  ii  I  .it  ion  |)iimiti\e.  A.--  well  as  splitting  the-e 
iiidi,'.'.  tie  rank-split  oiier.iiion  t  ,'n  n  i:.l  ei  -  theni  >o  t  iiey  are  \.,ilid  within  tin'  new  segmeni- 
but  iii.iintain  t  he  .-a  m,'  ordi'i'.  I  or  ex.miple: 


t  an  k- split  I  A  .  1  I 


'!  tij  ;2  I  3  (ij  jtl  1] 

jT  F'  T  F  F  Tj  'T  Tj 

hi  I)  1)  F  A  ll'  b  'll 
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l  igiiro  2:  An  example  of  parallel  Quicksort.  Each  pivot  is  picked  at  random  from  within  a 
segment. 

Ill  (Ills  c'.xamplo.  tlio  F  part  of  l lie  .second  .segment  .starts  with  the  indices  J  and  d:  tlns^.' 
lu'c  renuinbered  lu  (I  and  L  .>.0  that  they  lepresent  a  valid  index  set  for  the  new  si’Siinenls 
and  maintain  the  same  order.  J  he  rank-split  (operation  is  used  to  ujulate  irointers  when 
ju'rl'orniing  a  split  ojieralion. 

3.1  Recursiv'e  Splitting 

'J'ho  segment  ah.straction  ami  the  jirimit ive.s  wo  desci’ibed  allow  simjile  definiiions  of 
recursive  routines  that  sitirt  with  some  ^et  of  vtilues.  split  this  set  into  subsets  and  reciir- 
sivels'  solve  the  problem  on  e.acli  sub.-.et.  W'e  will  call  this  tecliuitnie  rcruraivc  .sjilitl iiifi.  .As 
an  ('.xamph'  ol  siu  h  a  techuii|ue.  lou'ider  the  following  paitillel  viusiott  of  (Juicksoit.  .A'- 
v.ilh  the  serial  algorithm,  the  tilgoiithtn  jtick^  om'  ol  the  k<'ys  as  a  pi\’ol  \';ilue.  sidits  the 
lU'Vs  into  luo  sets,  (ine  e.  iiil  uI'mI'  I  \'.dl.'ed  key-  allil  ole'  with  le.'.sej-  \', lined  key.-,  tllul  then 
I  ei  n  r-e-  oil  e.ii  1 1  sel  . 

1  iu,u  I  e  2  s lio'i'.  s  an  e.va  m p],.  ol  i  h''  p.i  1  .tilel  \ ei -  ioii.  i  he  rout  ine  pic ks  a  1  a miom  eh.im.ii i 
In  111  I  e.H'h  see  meiii  ;is  ,1  j)l  \  lU  \  al  iie  ij-inu  'he  element  op<'r;il  ion  ’ .  The  tilgoi  it  li  m  disl  ri  but 
this  piviil  Value  o\e|-  eacii  si'eiiieiii  ii.-iiig  .1  distribute  o|)eration.  and  splits  the  keys  bascil 
on  whether  ;i  key  is  gieaici’  or  ie--  than  the  |/i\'ot  using  tlie  delete-split  opiutition’’.  d'he 

I  ii-'ii  iiu  1  h.i  l  1  lure  is  a  pi  i  mil  i  ve  ep  m.  ni  wi.-e  random  opera  I  ion  which  in  each  (-1(0110111  1  akes  a  i  inl('f;er 
and  e-tiirns  a  p.-<  iido-i.oidom  iinnilicr  h--  ih.in  th.il  inlegci. 

■  lu.  the  (Iclete-split  opM.ilions  inslead  ol  lln-  split  operation  so  that  we  never  have  more  regmenl- 


I  I 
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algDiitlmi  is  now  api)li<’(I  rocuisivcly  to  the  rosiilt.  W'licii  tlio  luiinbors  within  all  soguu'iil  arc 
in  non  tlcciciisiiig  order,  we  rcliirn  ami  iiicrgi  the  split  sets.  As  with  the  serial  algorithm, 
this  algorithm  is  e.\i)ected  to  complete  in  t'.)(lg;()  ste|)s^.  In  the  .scan-model,  each  slej) 
recpiires  a  small  constant  number  of  operations. 

The  code  needed  to  implement  (luicksort  in  the  scan-motlel  is  as  follows: 

define  quicksort! keys) { 

if-auy  (shift-left(kcys)  <  keys) 

then  pivots  element! keys,  random! length! keys))) 

quicksort!delete-split!keys,  !distribute!pivots.  Iength!keys))  <  keys))); 
else  keys} 

This  general  recursive  splitting  technique  c;in  Ite  used  in  most  divide  and  comiuer  algo¬ 
rithms.  In  this  paper  w('  will  u.-,e  it  in  tin'  A’-l)  tre('  algorithm  discussed  in  Section  I,  the 
quickhull  algorithm  discussc'd  in  Sc'ction  N.l.  and  tin'  bimiry  tree  search  method  discussed 
in  .Section  9. 

3.2  Allocation 

.Another  useful  technique  is  allocation.  Many  pvc>I>\i'ms  retjtiire  th"  allocation  of  a  set  of 
elements  that  can  then  be  operated  on  in  |>arallel.  For  e.vample  considt'r  a  liiK'  driiwing 
algorithm  that  takes  as  ininit  two  endpoints.  calculat<'s  the  length  in  pixt'Is  of  the  line, 
and  then  allocates  an  element  for  eac  h  pi.xel  so  tinti  it  can  calc  iilate  the  pi.xel  positions 
in  iiarallel  !this  is  an  outline  of  the  .'ilgoriihm  we  discuss  in  Section  (i).  ,AKo  assume  tlmt 
several  lines  need  to  be  drawn  in  paralh'l. 

.Such  allocation  is  tiivial  with  tie'  operations  wc-  dehiied  in  Section  :i.  If  wi'  h.ave  an 
integer  cec  tor.  in  whic  h  c'ach  element  spec  ilie^  how  m.my  m'w  positions  it  nec'cls,  we  can 
Use  this  vectoi  dio'ctly  in  the  distribute  .iml  index  oper.iiions  to  distiibute  the  c'lements  to 
a pprcipria  t ely  - i/ed  ^eeiiienu.  S m  il  .1 1 loi  .1 1  ion  is  n-ed  in  t  he  line  d  r;i  wing  tout  ine  dc'seri bed 
in  .Sec  t ion  (j  and  in  tlie  line  (jI  sight  .dgoiithm  desc  ribeci  in  Section  7. 

t  il.l  II  <'l«  IIMlI  t  " 
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4  Building  a  A-D  Tice 


A  A'-D  t  ii'c  is  U'i'liui(iue  IV)r  splittiii'^  n  ])()iiits  in  a  I;  dinionsioiial  s])ace  into  ii  ri'p.ioiis  each 
with  a  single  ixjiiit  [s].  It  starts  hy  splitting  the  space  in  two  along  one  ot  tlie  diniensiuns 
n.'.iiig  ;i  k  —  1  d iiiiensi<jnal  plane.  It  llien  recursively  splits  each  of  the  suhspace.s  in  two. 
1  ignrr'  d  shows  an  e.xaiiiph'  ol  a  2-1)  tree.  .\i  each  stej)  the  algorithiii  innst  selcst  which 
dimension  to  split  \'.ilhin  each  siihspace;  the  Cfiteritjii  lor  selection  depends  (jn  how  the 
tree'  will  be  used.  .\  connnon  criterion  is  to  select  the  diiiietision  :dong  which  the  spretul  of 
point s  is  gre,,  t  ('st . 

'1  he  A'-l)  tree'  is  often  Used  as  a  stej)  in  other  algorithms.  3-1)  trees  ;ire  used  in  ray 
tracing  idgurithms  for  rendering  solid  objects.  In  s\ich  alg(rrithins,  objects  need  only  be 
storerl  in  the  regiotis  they  penetiate  and  rays  need  only  exatiiine  rc'gion.s  they  cro.-is.  'I'his 
cati  greatly  rednee  the  number  (.f  objt'cts  etich  ray  needs  to  extimine.  A’-l)  Irei's  are  als(j 
ns('d  in  iminy  jiro.xitnity  algorithtiis  siich  as  the  all  closest  ])airs  problem  [15]  or  the  closest 
jiair  problem,  discussed  in  next  section.  A-l)  t  r('es  hiive  tilso  be<‘ti  suggested  for  itse  in  some 
tmichine  le;ii-ning  tilgorithms  [21)]. 

'I  he  tilgorithm  we  describe  heri'  is  a  ptirallel  version  of  ti  sl.'iiidard  s('rial  algorithm  [22]. 
for  II.  points,  onr  algorithm  takes  ()(A’lgn)  ctdls  to  the  primitives  on  vectors  of  length  n. 
'I’hi.s  algorithm  is  optijjjal  in  the  sens'-  t  h;it  ev(m  if  simulated  on  ;i  serial  machine,  it  will 
run  in  tin-  same  ;is\  iupt ut ic  running  time  as  tin-  best  serial  ttlgorit htn. 

In  tiitiny  A-l)  tree  algorithms,  when  siilitting  a  space,  oik-  ixhnt  is  selected  as  the  split 
point,  tiiid  this  point  iti  idticed  in  tteither  siile  -  it  is  tised  to  divide  the  two  sides.  Iti  our 
algmlt htii.  when  splitting  ;i  sp.ice,  wr-  place  all  |)oitits  in  one  of  the  two  sides  —  we  assume 
ihe  sitlil  liiK'  lie.s  half  way  between  the  points  on  either  side  of  the  sjtlit.  For  this  reason, 
the  rdgorilhtii  might  be  more  a|)i)ropri;itely  ctdied  ;i  A-l)  sitlitting  rather  lhaii  a  A'-l)  tiee. 

Our  algorithm  cotisists  of  one  stej)  per  s|)lit.  Fiich  stej)  recpiires  C)(A')  calls  to  the 
primlti\e.s.  liefore  executing  any  steps,  the  algorithm  sorts  the  set  of  ])oiuts  accorrling 
to  each  of  the  A'  d i mensii nis.  The  sorting  can  be  ('X<'cuterl  with  the  (Quicksort  algorithm 
di'i  iis.-ed  (.iilier.  an  en  umeral<’- p.ick  .sorting  .dgorithm  di.scnssed  in  'll]-  ol'  a  version  ol 
('(ileA  Milting  .'ilgorilhm  [12],  In.Oead  of  kt'epiug  tin'  actual  values  in  suited  onler  for  each 
dimen.-,ion.  we  keep  ihe  rank  of  each  point  along  each  dimension.  The  rank  ol  ii  point  is 
ill'-  po.-iiioi:  the  [loinl  would  be  located  at  if  the  vector  were  sorted.  \\\'  call  the  vectors 
lli.it  hold  t  liese  r.inks.  niiil.-i'iihir.'  I  here  is  one  cu/iA'-i'i  c/oc  lor  each  dimension,  fi'gured 
sho\'.'  an  example  lor  a  2-1)  iiee.  the  initi.d  riiiik-n  rtiir.s,  and  the  rr-sult  of  the  first  ste|). 

At  eai  h  ,ii'p  ol  ihe  al'joiithm  the  III  Ilk-  n  will  contain  .i  segment  lor  each  snbspace. 
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x-raiik 

v-rank 


abovo-split-linc? 

rank-split  x-rank 
rank-split  y-rank 


=  [abcdefgh  ijklmno  p] 

=  [0  6  15  10  7  2  4  12  14  8  13  3  1  5  11  9] 

=  [13  7  4  3  15  6  11  0  9  8  14  1  10  2  5  12] 

[FFTTFFFTTTTFFFTT] 

=  [0  0  7  2  13  1  5]  [7  2  4  6  0  5  3  1] 

=  [6  3  7  2  5  0  4  1]  [2  1  0  5  4  7  3  6] 


Figure  3:  An  example  of  a  2-D  tree.  The  top  diagram  shows  the  final  splitting.  The  vectors 
below  are  generated  during  the  first  step  ■—  when  splitting  along  the  line  Tj.. 


ami  the  ranks  wilhiu  each  segment  will  be  tlie  correct  ranks  for  that  subspace.  It  suffices 
l(j  ilenionstrate  that  we  can  execute  a  split  along  any  dimension  and  generate  new  rank,', 
within  the  two  subspaces.  The  algorithm  is  then  correct  by  induction. 

To  sjjfit  along  a  given  dimension  tlu'  algorithm  distributes  the  cut  line  and  determines 
for  each  point  whether  it  is  above  or  below  the  line^.  7'he  algorithm  now  uses  the  rank-split 
operation  defined  in  Section  3  to  sjilit  each  lank-vcctor  based  on  whether  a  point  is  below  or 
above  the  s])lil  line.  The  rank-split  ojreration  as  defined  correctly  generates  the  rank  within 
each  subsjracc.  Each  step  therefore  re(|uires  0{k)  calls  to  the  primitives:  some  operations 
to  determine  whether  each  point  is  below  or  above  the  split,  and  k  rank-split  operations. 
Since  there  are  0[\gn)  stejis,  the  whole  algorithm  requires  0(^'lg7i)  calls  to  the  primitives. 

In  the  closest-i)air  algorithm  discussed  next  it  is  useful  to  keep  the  rank-vectors  for  all 
the  steps.  This  will  require  that  we  store  A:  Ig  u  vectors  of  length  n. 

5  Closest  Pair 

111  a  two  ditnensioiial  c  losest  pair  problem  we  want  to  find  the  pair  of  points  in  a  plane  that 
are  closest  to  each  other  (Euclidean  distance).  The  algorithm  we  describe  is  a  parallel  ver¬ 
sion  often  algorithm  described  by  Ilentley  and  Shamos  in  [9].  For  n  points,  it  reciuires  (9(lg  n) 
ciills  to  the  iirimitivos  using  vectors  of  length  0(n).  This  algorithm  requires  (9(n  ign)  mem¬ 
ory  (O(lgn)  vectors  of  length  0{n))  but  can  be  modified  to  run  with  (9(  Ig  ti  ig  Ig  ti)  calls 
to  the  primitives  using  0{n)  memory.  Atallah  and  Goodrich  have  shown  an  (9(lg n  Ig Ig  n  ) 
time  0{u)  processor  algorithm  to  solve  the  closest  pair  problem  in  the  concurrent  read 
exclusive  write  (CREW)  P-R.\.\I  model. 

Our  algorithm  consists  of  building  the  2-D  tree  as  defined  in  the  previous  section^,  and 
then  merging  rectangles  back  to  the  original  region.  Given  two  adjacent  rectangles  and 
llu'ir  closest  pairs,  a  merge  ste[)  can  determine  the  closest  pair  of  the  merged  rectangle  with 
a  constant  number  of  ctdls  to  the  primitive's.  Because  of  segments,  we  can  tnerge  many 
pairs  of  rectangles  in  parallel. 

Since  w('  have  already  defined  how  to  build  the  2-D  siilitting,  we  will  only  describe  the 
merging  phase,  l  lie  merging  uorks  on  tin'  sann'  principle  as  (.b'seribed  in  [9].  We  will  first 
ii'view  the  ])rinciph'  and  then  show  how  it  i-,  imph'Uientr'rl  on  the  scan-model.  We  will 
denote  the  separation  of  the  clo.sesl  pair  in  a  reciangle  II  by  ('/{. 

A.s  railicr,  llic  iid  iIdmI  fi^r  clnuj-siiii;  a  ciil  line  will  (U'|h*iu1  oh  llic  parliculiir  \is(*  the  k-\)  tree. 

In  alt;(jriihiu  it  docs  not  matter  ;ii  wli.jt  order  wt-  pick  tlic  dimensions  —  in  fad,  we  could  alwavs 

."j  ill  on  tlie  same  diiuen.'-ioii. 


Merge 

Line 


Figure  4;  Merging  two  rectangles  to  determine  closest  pair.  Only  12  points  can  fit  in  the 
2<^/n<n  ^  26rni,i  dashed  box  such  that  no  two  points  in  either  A  or  li  are  closer  than 

At  each  merging  stop,  we  know  the  closest  pair  within  each  of  a  pair  of  merging  loctan- 
gles  A  and  B  and  want  to  find  the  close.st  pair  in  the  rectangle  A  U  B.  I'lie  chjsest  jrair  will 
either  be  the  pair  in  .1,  the  ptiir  in  B.  or  a  pair  with  one  point  in  A  and  the  other  in  B.  In 
the  last  case,  the  two  end  points  uun-'t  each  Fu'  within  of  the  tioundary 

between  th('  two  rectangles.  W'e  will  call  this  region  /l/f'lsee  l  igiin'  1). 

If  we  look  at  a  ijoint  pin  AB',  no  more  I  han  1 1  other  points  in  AB'  can  be  less  than  f', 
away  froin  p.  f  igure  I  shows  the  tighte.^t  possible  packing.  If  we  have  the  i)oints  in  AB' 
sorted  along  the  merge  line,  each  point  can  (b'termine  the  minimum  distance  to  another 
(joint  in  AB'  l)y  looking  at  a  fi.xed  number  of  neighbors  in  the  sorted  order  (at  most  1  1  ). 
Once  all  points  in  .  \B'  have  found  their  closest  neighbor  in  AB'.  we  tiikr'  I  lie  minimum  of 
these  distances  to  fiiul  ‘i>'d  then  calculate  the  desired  result; 

W’e  now  show  how  this  techuic|ue  is  aijplied  in  the  scan-model.  I  he  merge  consists  ol 
the  following  stejjs  (each  recinires  ;i  constant  number  of  calls  to  llie  piimilives): 

1.  Derive  the  vector  ol  points  in  .lu  B  sorted  tilong  tin.'  rruection  ol  the  split  line,  lo 
get  this  vector,  w  e  need  only  keep  I  he  approjniate  rank- vector  w  hen  we  const  met  t  he 
4-1)  1 1  ce  remem  her  t  h.i  t  when  building  a  /.I)  tree  we  had  I  In'  sort  ed  order  for  all 


V'  "Si’  V 


(linie!isioii>  foi'  ;ill 


L*.  Dftctiiiiiic  liy  t.ikiiyu,  t|ic  miiiiiiiiJiii  of /'..i  ha.  ))isl liljutc  to  all  points 

in  the  s(  ill  I'd  vrcloi'  of  .1  U  U. 

d.  I’atk  c'K'iiu'Hts  whii  li  ail'  witliiii  of  tlie  iiiorso  line  nsiiig  tlu'  pack  o])('ration  into 

a  now  sort  i'll  vt'cl  i  it  .1  W . 

1.  Sliifl  t  hi.-'  vi'c'tot  li)  1  lie  lielit  and  calculate  the  distance  from  each  point  to  its  neighbor. 
Repeat  this  six  times  to  eel  the  six  iieiehhofs  on  each  siile. 

'i.  DeU'tmine  b\'  lakin,”  th<‘  minimum  distance  found  in  the  ])revjous  step  using  a 

min-reduce.  l  ake  the  minimum  of  imm  a'ld  ^AB'  to  get  6^s- 

The  iilgorithm  will  run  in  O(lgn)  lime  because  the  ^--D  splitting  runs  in  O(lgtt)  time 
and  there  are  (J(lg  ii]  im'ige  sti'ps  which,  as  shown,  each  step  re()uires  constant  time. 

If  t  lie  t'Jf/-  fg  III )  space  re(|uired  to  .>,1  ore  the  milk- vectors  during  t he  2-D  tree  is  a  jiroblem. 
VO'  (  an  derive  I  lie  Mil  led  vc'clor  for  .1  U  II  on  t he  lly  by  merging  the  sorted  vectors  of  ,1  and 
II.  ’I  his  iiieigi'  reiptiri's  flflglgn)  time  [10]  and  will  therefore  incretise  the  running  time 
of  the  whole  algorithm  to  (RIgnIgIgn).  In  the  conclusion  we  mention  that  it  might  he 
reasonable  to  considi-r  the  imuge  ojieraliou  as  a  unit-time  [irimitive  of  a  vector  model.  II 
\ve  i  111  dud  1'  a  mei  ue  pi  i  mil  it  e.  the  algoi  it  h  111  will  run  in  0(  Ig  it )  ciills  to  the  primit  ivt's  with 
()[  II )  space. 

6  Line  Drawing 

Two  dimeiisional  line  drawing  is  t  he  prohh'in  of;  given  a  pair  of  points  on  a  two  dimensional 
grid  (the  1  wu  end  point  s  ol’ a  linei.  di'termine  wh;it  pixels  in  a  liiiitr'  resolution  grid  lie  on 
a  line  between  the  endpoints.  I.ini'  drawing  is  used  extensively  in  jiractice  in  generating 
liiiiipiiii  r  images.  rsj)eciall\’  in  comjmtei  aideil  design.  In  this  section  we  descrilie  ;t  very 
'iiiiple  line  di'.iwing  i(  ml  i  He.  It  gi'iierali  '  llu'  .-..iiiie  set  of  jiixels  as  does  till'  sim])le  (ligittd 
d  1 1  |e  I  eiii  i,i  I  a  iia  !y /e|  i  I )  1 )  A  )  si'i  ia  1  I  ei  hnii|  ne  [  1 '.  i  .  I  he  rout  me  i  a  kes  a  small  const  tint  n  iimher 
o'  I  ,d!'  to  I  he  piimii  i\  i's  on  '.eci.iis  .it  most  a-  long  as  the  nnmlier  ol  iiixels  in  the  output. 
H'l.iiise  111  the  segiiieiit  leiiiiiia  tSeciioii  J.  I  i.  the  iduiitie  can  be  usi'd  to  draw  many 
IllM’.,  Ill  p.il.il  I  1,  I  lie  liilllilie  Wi'  de-,1  till'’  ll.l-  liei'll  evlelub'd  li\'  S.ih'lll  fd-)]  to  ll'lldi’l'  solid 

o!  I  jl  ’(■  t  s  . 

1  he  basil  id'  .1  ol  I  In'  I  oil  I  i  lie  i ,  to  ( .ilc  iilal  e  I  lie  n  M  111  her  ol  pi  xels  ill  ;i  line  and  .illocal  e 
.1  -i'l  ol  rei'i.;-  I  rl  ih.il  li'iiglh  wllh  lie'  line  inlormatioii  distributi'd  acio.ss  lie'  wetols. 


longtli  =  line-length(/)i ,  =  •') 

A  =  increment(/^i ,  /)>.  lonf;(li)  =  (J,  .2) 
pixels  =  lenglli  +  1  =  (i 


index(  ])ix('ls )  =  [  0  1  2  2 

distribute(7^i ,  pixels )  =  [(1.1)  (1.1)  (1.1)  (1.1)  (1 

distribute!  A.  ijixcis)  =  |(1..2)  (1..2)  (1..2)  (1..2)  (1. 

flnal-position(/;i,  A.  index)  -  [(1.  1)  (2.  1)  (3,  1)  (1.1)  (') 


Figure  An  example  of  parallel  line  drawing. 


.  1)  (1.1)] 
,.•2)  (I..2)] 

.  1)  ((i.  D] 


1  li(Mi  !);i.s(j(l  oil  t  lie  lino  i  ii  iui'inat  ion  ami  a  iiniiiim  imiox  for  oac  li  olnmiMit .  t  im  cii'i  m'li  i  s  (an 
t'aknlali'  tlicir  linal  [Ki.^ilion  on  llm  ,n,i'i(l.  I'ii^nri'  o  illimiiaics  an  ('X.iiiiplo. 

1  in'  ('ode  111 '( '( I '  I !  I  o  ( ! :  a  VO  a  li  m '  i-i: 

define  line-length' .  /o'i{maximuni((/jj.x  /'i -X ).  ( /M-v)'} 

define  innementi //| .  k'ligilii'l 
•"v  '  /’j-''  /  long!  li; 

y  -  -  ( /o  .y  yn  .y )  /  longt  li} 

define  final-position{/Ji .  A.  iml('x){ 

\  —  1)1. X  +  roiin(l( iiuk'x  x  A.xl: 
y  —  pi  .y  +  rouiul(inclcx  x  A.y)} 

define  line-draw{ /)| .  /)_>)( 

li'iiglli  —  line-lengtli( /)| .  /o): 

A  —  incrementi /»! ,  /y..  Irn';lli); 
plxi'l-'  —  li'ngi  li  -r  I ; 

final-position{distribute( /'I .  |ii\cl.--).  distribute!  A.  i>ixc!>).  index( pixels ) )) 

'fill'  line-length  loniim'  cah  ulaio.-  llic  Iciiyili  of  i  In'  liin'.  flic  increment  rontino  talcnlalos 
till'  X  and  >■  iiii  ii'iin  nis  ln'iwcon  adj.ici  ni  pixels  in  the  line,  flie  final-position  routine 
la!  dilates  the  pixel  po.^it  ion  of  .a  point  given  one  enil|)oinl  of  the  line,  the  x  and  .v  i  note  men  is 
o|  the  line,  and  tin'  position  (index)  ahung  the  line. 

1  lie  line-draw  routine  iiscs  the  distribute  operation  to  (list  rihute  /p  and  the  inci'emeiit 
(A)  n\ei'  I  1;  iie-lengt  h  r  1)  eh'iin'iil  s.  and  Uses  the  index  r)|)eialion  to  generale  ,'i  set  of 
((in-erutive  iniee,.|s,  j,  ,.,i(di  elements.  We  iieed  (line-leiieih  f  1)  elements  hecaiise  \ve 
'.’.ant  to  1  ill  1  lid  e  hot  ii  e  ii  d  pol  11 1  s. 

7  Lino  of 

( 1 , I  'll  .111  y  II  Ii  V  y  II  u  rid  o|  a  1 1  i  I II  lies  ami  an  ohservat  ion  poi  n  t  on  or  a  liov  c  the  mi  rhn  m. 
.1  hin'  o|  s|.j|,(  .ilM'iillim  linds  all  points  on  the  grid  visilde  from  the  ol  iserv  ,i  t  ion  point. 
I  ig'iie  I,  sIiijv.-,  nil  '"-i.imple,  .\  line  o|  si-ght  al.gorithm  can  he  apjilied  to  help  determine 
'.vh'ie  III  loi.iie  potential  eyesores.  1  Of  exti  III  jile.  \\  In'll  designing  a  linilding.  a  highway  oi 
a  I  ii\  . . .  It  is  olien  i  II  lorma  I  ive  to  know  lioiii  where  the  "'eyesore  will  he  visih'e. 

I'l 


lilt'  hill  in  liii,^  rctniircs  Oil)  call.-.  lo  ilic  pri  iiiit  i\ i'-,  ii-iii<_; 

Uini.  1  111'  lia-ic  iilra  is  to  allfx'alo  a  M',i;ini‘Ut  in  vnctoi  lo.-  \  r,iy 

liial  piopayaii's  in  tin-  pi.iiie  lioni  I  lio  oUsolvalion  [)uinl.  lioncelui  t  li  ii'leiicil  to  a-i  A’,  ton 
lioiiinlai)  imsiiioii  isci'  1  n^nii'  i  i.  Ma.si'cl  on  .sonio  {'alcnla t ions  on  tln'  points  in  nai  h  \vn 
'  :i  ih'ti'niiini'  il  i  ho  point  is  visitilc. 

1  111'  a  1'j.oi  i I  11  11 ,  con-i'is  (it  tlu'i'i'  liasic  .stcjis. 


1  l-.icli  jioiiit  j)  ill  till'  mid  (alinlati's  tlu'  vertical  ansle  lietucen  the  liorizont  al  plane 
iliat  p,i-ses  itiidne'i  A  ( I  lie  ol),ser\‘al  ion  point)  and  the  line  IVoni  j>  to  A’,  1  Ids  is 

evec  lit  ed  liy  d  i  s!  i  i  1 1  a  1 1  n  y  the  |ocat  ion  of  A  over  all  point  s  and  i  ainda  t  i  iiy  the  an  I  (in 
ol  the  liori/ontal  ddiereni'e  over  liie  vertical  diU'erence. 

-■  1 nlyoiiihiii'  allocates  a  set  ol  rays  one  for  each  hound. ii>  yrld  iniint  and 
(1  Isi  1  i  iiin  i-.s  the  a  leyh's  1 1  oni  each  |iolnt  pin  t  h«'  y  rid  t  o  all  the  i  ,i  -  i  i  I  leh  .n  ys  i , ,,  1  ,,ic  h 
ac'  Is  a  -eymcai  in  a  Mctor  we  will  call  the  rm/  liii<  . 

■  1.  loh'ov.iiiy  a  ray  Iroin  A  lo  ihe  Innindary.  a  point  p  is  visible  if  its  atiyle  is  yieaiei 
’lam  all  the  anyles  that  precede  it  in  the  ray.  J  his  lan  he  detenniiied  foi-  all  points 
ill  all  rays  with  a  single  seyiin'iited  max-scan,  and  a  coniitarison. 

1.  \  isihiliiy  iiilorniat  ion  is  returned  b;ick  to  tiie  grid  points.  Since  tt  grid  point  cm  have 
a  position  in  many  rays,  the  visibility  (lags  are  coinbiiu'd  using  or. 
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Since  steps  I  and  d  should  be  clear,  and  step  4  i.s  basically  the  revr'ise  of  step  '2.  we 
oiii\  desiiilie  sK'p  2.  lo  itlloctile  the  /Y/i/  slructun  the  algorithiii  draws  ;i  line  fuiiii  the 
oh'i  ivaiioii  point  toeai  h  boundary  element  using  the  routine  discussed  in  Se,  tiou  h.  linch 
|ioi;iI  iniylil  helony  I  o  se\'i'r.i  1  ol  these  rays  (points  netll  A"  will  heloiiy  to  Iiiole  rays 

nr, 11  the  edye-i.  lo  d  i,..!  ri  hn  I  <'  the  angle  from  a  grid  point  to  ,,11  tin  it 

heony-  to.  I  he  ;i  lyol  it  h  111  create-,  a  not  Imr  -ey  men  ted  Vcf  I  or  si  flic  t  n  i  e  1  In'  ee/e,i  s/  Cie  /  (O  f  . 

In  I  a  e  1  opy  -Inn  I  ii  i  r  : •  ,,  ly.u  it  I,  n ,  idiocn  t  e-  .i  seynien  t  lor  enc  h  yi  n!  poi  n  i  //.  1  he  -  i/e 

"i  lae  -eymrJil  j,,,-  p,„nl  p  i-  rijll.ll  lo  tin'  lllllllher  o|'  laVs  /,(  helony-  lo  I  hi-  (.ail  he 
'  I'  ’ '  n  n  li  I  ted  I  run  1  tin’  :  c  l.a  i  i  '/r  pi  i-il  ion-  i  .1  /n  A  n  nd  the  lion  nda  iw  Mac  li  pi  >1  n  t  p  d  i-l  I'i  hn  I  e- 
ii-  aiiyle  to  it-  -eynniii  in  I  lie  copy  -t  rucl  lire  using  tin'  distribute  operation. 

1  here  i-  i,(  lU'  a  1  t  o-  1  in.i  pjiing  h.i  v.  een  pi  i- it  ions  in  the  copy  st  rin  t  u  i  e  and  po- it  ions  in 
I  he  nlyorithm  can  (.ih  nlnle  the  permillnlion  illdice-  neeiled  to  exei  nle 
I  on  the  locnii.iii  ol  ,\  .  ()nce  the  angles  Intve  hern  iieimuted  to  the 
ih.oiiihiii  ex.'cntr-  -irp  lo  return  the  inroi mnilon  hark  to  the  gild 


.> 


it  leclutes  using  au  or-reduce.  At.  roinpli-tion,  all  jjoiiits  visible  fi'tim  any  ray  an'  marked 
and  returned. 

Idle  Uuigest  vectcji's  re(|ulred  by  llie  algoritliiii  will  be  the  veclnis  of  ibe  (ups'  ami  r,i\ 
St  nicl  111  e.s.  It  is  not  hard  to  sliow  that  for  a  ^/n  by  grid,  indejiendeni  of  tin'  loi  alion  of 

A’.  tlic.-'O  vectors  will  ha\’e  length  2n. 

8  Convex  Hull 

I  In'  planar  con\ex  hull  problem  i-;  given  u  [loint-s  in  tin'  phiiie.  lind  uliii  h  ol  ihi  .^e  poinl.- 
lie  on  I  he  iierimeter  ot  tin.'  einalle.si  ('(.nivr'.x  region  that  coiii.iins  all  point.^.  1  he  pl.iimr 
convex  hull  jiroblein  i.s  probably  1  he  most  studied  problem  in  ((.iiijmtatioii.il  geonu'irv. 
both  because  it  is  a  sim]ile  probh'tn.  making  it  «>asy  to  stuiiy.  and  because  it  lias  maiiv 
aitplictiiious  -  tipplications  rtuige  from  c(jmpnter  grai»hics  [11]  to  statistics  [17j. 

In  this  section  we  describe  three  scan-tnodel  ba.secl  algorithms  for  determining  the  convex 
hull  of  a  s('t  (.)f  ptjints.  The  first  two.  a  parallel  Qiiickhall  [22]  algorithm  and  a  parallel 
•Jarvis  iiituch  algorithm  [Is. 2],  tire  very  sitnple  atid  likely  to  ]>eiform  wi'll  in  practice  but 
iire  not  piovably  (.iptinial.  The  ihiial  algorithms  is  more  com])liciit ed  twid  iiiipr;ict ical  but 
is  t heoi'ei ically  optimal.  1  he  tdgoiithm  i.s  bttsed  on  a  partillel  algorithm  de-igned  for  the 
concurrent  read  exclusive  write  f('Ul',\V)  I*d{.\M  mr>dei  [l.l]. 

8.1  QuickHull 

lids  is  ;i  ptirallel  version  id' t  In' (Ttit'kllnll  idgorit  hm  [22j.  1  he  nick  1 1  nil  algorit  h  m  wa^ 
giw'ii  its  n.uiK'  iiec.uise  of  its  simihirity  with  the  (|nick.'ort  idgoriiliiii.  hike  (piicksort,  the 
t[uickhu!l  rilgorithm  [ticks  a  jiivttt  ehmient.  a  jtoint;  sjtlits  the  data  bast'd  on  the  [tivot:  .uni 
is  then  lecuisivekv  applii'd  to  e.ich  of  t  he  split  si'ts.  .\lso  lik<' (piii  ksort ,  t  in'  pi  vot  ('leiiK'iit  is 
not  glial  a  11  teed  to  split  t  lu'  data  into  set  s  with  any  particiilai  rat  io  of  si/es.  so  that  in  tin' 
v'.dl.-t  case,  the  algorithm  c.ili  letiuiic  ii  steps, 

figure  s  sho'.vs  iiii  ex.iiiiple  ot  t  he  ( j  ii  it  kh  M II  algtttiihm.  The  aigt  nit  Inn  lli't  splits  the 
pi'. Ills  in;, I  I I )  v-.iil;  a  line  tha'  p.is'i's  b.'lwi'i'ii  I  hi'  two  c  i'\lien.,i  lets  tall  1  In.e 

pt  || !  1 1  '  /  a  In  1  r  .  In  '  ki  -i  .i  li  lin 'd.  i  '  h  i'  |s  exi-i  u  t  t'd  wi  t  h  <i  h'W  1 1 'i  1  in  e  ;;  in  I  1 1  is  t  i  i  b  ii  I  e  i  ipe|  ,i 
lion-.  'I'lin  '  n  i:,e|iiv,,,-c  .iii'limeilt  i  .di  nialioiis.  ainl  a  split  opt  i.tlmn, 

1  a '  .  1 .  j '  c  1 1 1 . 1 : .  1, 1 1 . .  I  ft  i|  I  s :  \  1  '1  \  s  pi  it  s  I'iii  li  ol  the  t  '.V 1 1  s  u  bs  pat  u  .  into  1  c,  1 1  ii  s  1 1 ,  tin 

l'’in'  _■  '''P-  It  i'  t'  illl!'.''-  I"l  e.n  ii  jliiinl  /I  in  till'  sulispat  1'  the  pe|  pemlj,  ll  l.l  l  tiislaint' 

I I  ol  n  'ill  1 1 '  '1 1 1 1  1 1 1  I  . ,  e  1 1  In  .’  I  ,  1  a  1 '  I  .1  n  In'  i  ut  It  ii  l.i  t  eil  w  it  h  a  c  I'oss  p  i .  n  1  nc  t  I  )l  I  In '  1  i  Ill's  /  /  ii  ml 
I  j  I-  1  In  '  .1  !  'goll  1  ll  !  I  I  semi  '  .  !  ke  l.l  I  I  hf'l  [toi  lit  I  O  MU  I  In'  |  j  In'  /  c  ,i  ml  d  bl  11  ll  U  t  I's  it  t  o  a  1  I  ot  ill'l 


;a  b  c'  1)  e  f  g  h  I  j  k  l  m  n  o  pi 

A  ;b  d  f  g  h  j  k  m  o;  p  [c  e  I  l  n: 

A  ;b  f;  j  ;o,  p  n  [c  e] 

A  B  J  O  1’  N  C 

1  An  example  of  the  QuickHull  algorithm.  Each  vector  shows  one  step  of  the  algorithm. 

Since  .1  and  /’  are  the  two  .;  extrema,  the  line  .1/’  is  the  original  split  line.  J  and  are  the 

farthest  points  in  each  subspace  from  1/’  and  are  therefore  used  for  the  next  level  of  splits 

The  values  outside  the  brackets  are  hull  points  that  have  already  been  found. 


<’l('  iu<“ii t  ^  i  !i  t  h<‘  Ml  l)^’.  i;\<  li‘!  >  <'al!  t  !ii>  |>t  >iu t  /.  ll  should  l>a  c  h‘i\i'  Uiai  /  Ik  >  nu  1  In  ‘  (  on  x 

liul!.  I’liiiii.'  wliliiii  ilii’  hr  liiiiiii)!  In’  (III  llic  t'diivi'X  hull  iiiid  iiic  (■liniiii.iii'il  wiih 

a  pack  i)[)i  latidii.  1  In-  iiuiiil  /  is  iiuw  u.scil  In  lul  l  her  .sjilil  each  scp,ui(.'Ut  hascil  nu  which  ol 
the  t\'.()  sides  (if  lh(‘  triaiipli'.  ll  nr  rt.  they  fall,  I  he  alK''i'ithiil  is  now  apjilied  li)  the  lie\'. 
s(‘eiii.  ats  i.'i  ursivei\'.  1  he  ahjiiiithm  is  foiiipleieil  wlu'ii  all  seaineuls  are  eiii|)ty. 

1.11  h  step  1  ( ■([  u  i  1  cs  a  siiial!  ((ills  I  a  Ml  iiuiiiher  ( >1  ca  lls  to  the  priiiiil  i  ves.  .\s  with  the  s('i  ia  1 
CXiickllulI.  lur  III  hull  pdiiits,  the  .ileorithni  runs  in  (,)(ln;n/j  stops  frir  well  distributed  hull 
pdini'.  and  has  a  wiU't  (  ase  luiiiiinp,  tiiiii;  of  (fit")  steps. 

8.2  Jarvis  Marclt 

1  his  is  a  parallel  \(.'rsioii  nf  tin,'  .Jiirvis  man  h  alttnrit  hin.  .As  with  the  seiial  xersidii.  it  will 
t'.'di  K  c.i'll  when  iliena  an'  only  <i  levs  pi.iints  on  the  hull,  such  as  when  the  (  (Uive.x  hull  is  a 
simple  pdlvedii,  1  he  a  1 ‘j,i  U'i  1 1 1 1 1 1  starts  at  an  e.xtieiiiun  pdini  i  aiul  liiuls  the  point  ii  that 
t'.i.d'.e-  the  UeiximU!'.',  pdlav  at. V. 'Pli  (  11  is  til''  fe'xt  point  on  the  hnih  1  ho  'alaoiiihiu 

then  !i  !id-  the  ma  xi’uuill  (id'a  !'  all '-'!<■  to  t  hl.s  pdiiit  .  I  he  step  lepe.i  I  s  a  roll  lid  I  iie  ii  nil  U  ll  1  il 
S',  I  I  et  i;  I  ll  Id  the  dl  iei  lla  1  [Idi  n  t  ,  I  d  h  lid  e.udl  hull  pdi  III  We  Iieeil  a  |e  VS  arithmetic  ope  la  I  mils 
,1  la  I  a  'i  I,  c!e  ma X- r eci iice, 

1  di'  III  liull  points,  this  alaoriihiii  reiiuires  (A'")  ste|is.  hut  each  step  is  so  simple  that 
in  some  cases  the  al'Aoi  it  ii  III  is  f.isier  than  the  other  ahj,drit  h  im  men  I  ioiu'd . 

8.3  \  /(  Merge  Hull 

This  alaoiithm  is  a  v.niaiidii  ol  a  par.ilh'l  alaoriil.iii  siiaia^csi.'d  in  Tl  and  i  ndepeiideni  iy 
ill  1  .  1  loir  a  l'a,di  it  h  m  is  ha.- ( d  on  the  i  one  ii  i  rent  retid .  e,X(  liisi  v  c  v.ri  i  e  ( (  '  li  I  i  \\’ )  1’  -  li  .A  M 

illddi;.  W'e  (-.il.lidt  ll-e  iheil  .lladrilhlll  d  i  I'ei  1  1 V  llecailsO  llu'  s('a  1 1  -  1  i  Idd  e  1  dot's  Iiol  pel'lllit 

((.inciii;i  n'  ata  mss  to  a  simple  v.due.  a  in  cessarv  part  of  their  .ihpoi  it  him  Ihe  variation  W( 

(led  i'!, IS  keejis  ,,|1  eleme|;ls  if.it  let  |  U  i  I'e  the  'ailie  (hit. I  ill  .1  CdllliaUdlls  seaiiieiit  SO  the 
I..lt,i  t.ili  i.e  li  ;  .  :  ,  1  ,iiO  .|i  ll-ine  ,|  d  I  .S  t  111)  11  !  P  d  p  e  1  ll  t  id  11  .  1  h  e  (  d  1 1  I  I  I  i  I  1 1  I  id  11  dl  dill  Ve|sid||  js 

'  lid  .  ..  e d ,,  ;  . . a.  i, ;  1 1  ii '  i .  ,id  dj a-i  .1 1  idu  c.i n  he  1  epl.n  ■  d  h V  t  he  distt  ibu te  opera  t  idii  a  lid 

I  a.  1  a  , .  ■  a  'lee  s,  .,1 : .  ij.ei  la  h  ;  1  i  is,  n  .s.-i  1  m  the  next  seel  i<  in,  1  1  he  1  he  ( u  i'a,ni.d  a  InoiT  h  m . 


I  la  ■,  ,1  :  ..1  '  idli  We  d  t  "  idle  rails  -(V  il  h  i  >(  fa  //  i  ca  Ns  to  1  iie  pri  m  1  I  1  ves  .  \  \  e  1  ie,e  i  n  h,V  1  eviev,  i  im 

tie'  (  I ;  I  W  .1 1  v  u  1 1  h  :  1 ' . 

I  ,  e  ,d  eii;  1 1  il  II I  soil'  t  h‘  p(  -ml  '  ,11  Cd|  d  i  ni;  to  1  hei  r  .r  Codl  d  i  na  1  e.  it  s]  li  ,  .s  l  Ii  is  or  de>  i  ;a_; 

i  nt  d  \  ll  ei  I  il  a  1  -  i  /  ed  si ’t  s  (  i|  pi  i|  nl  s  a  In!  re(  II  Isivcly  solves  t  he  ((  III  vex  hull  IdC  e.l  I  ll  set  ,  1 1  I  lien 


-*  ■-  .-■(•-»  - if- 11 -.(1.. 


1  i.u,ur('  9:  An  example  of  the  ^/Ti  merge  hull  algorithm.  The  horizontal  dashed  lines  .show  the 
division  of  the  points  into  groups  of  v'T'  elements  each.  The  subhulls  within  each  group  are 
marked  with  solid  lines.  The  upper  chain  is  the  chain  A  B  J  0  P. 

uic'j-yo.s  the  ^777  subhulls  (see  Tii;uic  9).  The  .sort  and  the  merge  both  take  O(lgn)  time’°. 
The  ruuning  time  of  the  algorithm  thus  lias  the  recurrence  relation  l'{n)  =  T{\/n)  +  /rlgti 
wliicli  yields  77(lg«)  time. 

Since  the  elements  can  be  sorted  using  e.xisting  algorithms  we  will  concentrate  on  the 
merging  step.  The  merge  is  executed  in  two  irarts:  one  finds  the  u[)per  chain  of  the  convex 
hull  and  another  finds  the  lower  chain.  I'he  ujiirer  chain  is  the  section  of  the  convex  hull 
that  runs  across  the  top  between  the  two  .r  mtixiimi.  In  the  CRllW  algorithm  the  merge 
of  each  chain  works  as  follows, 

rite  algorithm  tis.^ign.-^  tin  elenn'iit  (a  proce>^or)  foi  each  pair  of  stibhnlls.  Since  there 
ate  y!  II  .>nbliulls.  (Jill}  el''nlenl^  a  1  e  .-n  llicieii  I .  Tacli  ol  llie.-,e  pairs  linh'peiidently  hnds  the 
upper  tangent  liiu'  meni ' '  bctvo-eii  it'  t  wi  i  siibli  nl  Is  ii'ing  .i  serial  mi'thod  ol  Overm.ii's 
idl).  1  his  met  laid  e.si'c  ii  i  e.,  a  binai’V  search  alternating  hetwe<'n  the  two  subhulls,  and 
leipiiies  (d(  Ig //  )  liiiie.  ,\ I  the/.'  s  1  ep  ol  I  h  < '  hi  Iia  r  V  sea  ic  h  .  Ji  U  ele  men  t  will  eit  her  go  down 
the  h'lt  brain  h.  the  tight  blanch  lU'  will  stay  still. 

Once  ihe  ii[)per  laiigeiii  line'  havi'  been  loiind.  the  tdgoritlim  determines  the  bri<lges 

1  !i''  .1  I  !i  III  oi  (  'ol*'  [  1  » ,ni  1>*  n  -•  d  loi  .'ndi  I  iii-;  in  t  In*  ( 'll  I  .  \\  iin*'  1«1. 

‘An  11  PI X  I  t .1 11  i  li I  oi  t  wo  of  p«jiiits  i>  tin*  liiu-  t ha f  passes  f  h rr/n^h  at  least  one  point 
f  1  "111  '  .o  fi  ' '  (  "<1  lh.it  .ill  nf  fc  I  p  - .( fi  I  -s  I  M  t  In  t  \\  - 1  f .If,-  |>r  h  »u'  tie-  line. 


the  \/^  miIjIiuIN.  1  li<'  ;in'  I  lie  ii|)|ii'i  I.iii^ciil  1 1  ii  i'-m '^^iiicii  I  lli.il  l)|•l^)llL;  In 

the  uijper  ( liaiii.  lo  liml  which  of  the  uj)[)ei  l.iiij^eiit  lines  ;ii<'  hridj^es.  e.u  h  suliliull  liinls 
the  hi;i,he,st  sloped  line  in  both  dire<'tions  (to  ;i  point  on  llie  lieht  and  to  a  |)oinI  on  the 
left).  If  the  joint  foriued  by  these  lines  is  convex,  then  both  lines  are  bridges.  If  the  joint 
formed  by  the  lines  is  concave,  neither  arc  bridges.  .-\ll  edges  on  a  .snbhull  that  lie  Ijetween 
bridges  of  tliat  siibhull  also  belong  to  the  convex  hull. 

This  algorithm  cannot  be  imidemented  dio'clly  on  the  scan-model  sime  each  pair  (d 
subhulls  independently  finds  the  upper  tangent-line  segimmls  using  the  algorithm  of  Ovm  - 
mars,  and  will  therefore  rcfiuire  concurrent  reads:  several  pairs,  while  executing  the  bin.iiy 
search,  will  rer[uire  access  to  the  same  elements.  To  avoid  the  concurrent  read,  we  jdace 
each  of  the  sets  of  \/n  points  that  Ixdong  to  tin'  same  subhull  in  its  own  segment.  We  then 
use  a  general  binary  search  method  ib'scribed  in  the  next  section  to  exeiute  tin’  binaiy 
search.  This  search  will  recpiire  O(lgii)  time. 

Our  variation  of  the  CR  ti  W’  a  Igor  it  hm  runs  wit  h  t  he  satin’  number  ol  calls  to  i  lie  pi  im- 
itives  ;ts  the  originrd  since,  as  with  the  original,  the  sort  runs  in  (>*ilgn)  lime,  and.  as 
shown  tibove.  th(’  merge  also  runs  in  fRlgn)  lime.  In  a  sense,  this  variation  liades  the  the 
concurient  letid  captiliility  kir  the, scan  ca|)abihly. 

9  Biliary  Search 

In  thi.s  section  we  consiih’r  the  problem  of  ii  elements  of  a  sc’t  .1  each  executing  a  binary 
search  on  a  binaiy  tif  i'  /  willi  m  vertices.  W'e  assume  that  the  i  ri’e  7'  is  organized  in 
a  vector  using  the  standard  leap  ordeiing:  the  root  vtdue  is  stoied  ;it  7jl]  and  tin'  two 
chihlren  of  a  vortex  stored  at  7’(i]  are  stored  at  7'['Ji]  and  7  [di  +  ij.  With  a  cuiiruri'ciit- 
ridil  ]irimitiv(’.  a  binary  search  is  simple:  each  eh’inent  of  .1  stalls  bv  reading  the  root  of 
7  .  lie.  ides  whicli  way  to  go.  and  lollows  a  jiath  down  to  the  heaves  based  on  a  test  and 
some  simple  aril  linn’tic  at  eai  h  vi’i'li’x.  .Sm  h  a  si’.irc  h  requires  concurrent  access  by  man\ 
I’lemenis  ol  .'1  to  a  sirngle  element  ol  I  . 

do  execute  the  binary  si’aich  Using  the  sian  piimitiv<’s  instead  of  a  (onciirrent  read 
primitive  vv  c  can  use  a  met  hod  b.ised  on  rec  u  i '  iv  e  split  1 1  ng.  We  si  ,i  1 1  with  all  the  eh'ment  s 
ol  .1  in  a  single  segment  <ind  then  split  that  seeuient  ba'-ed  on  vvheiher  an  element  is  going 
to  the  right  or  to  the  h'lt  child  ol  tie-  tool  ol  I  .  We  ilii-n  os  ui'ively  split  within  I’ai  h  ol 
t  liCse  segUK'lit  S.  based  oil  d.i  t  a  I  rom  the  next  lewd  ol  the  t  I  ee,  si  la  e  all  the  elemeii  I  S  ol  ,  I 
that  ai(’  acci’ssing  the  saiiie  viutetv  o|  /  will  bi’  in  a  loniieuoiis  seena  iii.  we  can  use  tin- 
distribute  o]H’ration  to  di-t  ribute  i  he  value  liom  eaih  veite\  of  i  la  t  iee  to  the  ehuneuls  that 

di. 


Ill'Cll  il. 

W'c  now  considci-  a  ,u,ciu'ializali()ii  on  llie  .simi)l('  liinary  search.  .Vt  each  ste]).  :us  well  a.s 
allouiue  an  element  of  .1  to  go  to  the  left  or  right  child  of  the  vertex  of  7’  it  is  currently  at, 
we  allow  it  to  remain  at  the  same  vertex  of /'.  'I'his  means  tliat  thej’e  might  he  elemejjis  of 
.1  at  all  levi'ls  of  the  tree  instead  of  at  a  single  level.  We  also  allow  new  elements  to  enter 
the  seaia  h  tree  at  each  ste]).  I'igiire  10  illustrates  how  we  store  the  elements  of  A  and  an 
eAani|)h'  ol  a  step  of  tlie  generalized  binary  search. 

lo  exec  ute  a  Atep  of  tin'  binary  seiirch,  we  must  somehow  ajrpeud  the  elements  at  a 
\erlex  r  that  remain,  with  tin*  ('h'liu'iits  bc'ing  pas.sed  down  from  the  jearent  of  c.  'I'o 
.ippeinl  the'  elements,  wc'  can  use  the  append  operation  discussed  in  Section  d.  I'he  Ijasic 
idea  is  first  to  separ.ite  the  elements  that  remain  from  those  that  go  to  a  child  into  two 
sepiiiate  vectors  using  two  pack  operations.  For  the  example  of  Figure  10  this  wamld  return: 

remain  =  [«o]  []  [«.i]  0  ["5  «g]  []  [ny] 

uot-rc'iuain  =  [c/,]  [n^]  [«a]  []  []  []  [] 

W’e  till'll  splii  the  oni'A  going  to  a  child  based  on  whether  they  are  going  to  the  h'li  or 
li'zlii  cliihl  Using  a  split  opc'iatiun.  riiis  wcnihl  rc'turn; 

sphtnot-rc'inain  =  (]  [«,]  [«,]  []  [«,]  []  []  []  [1  0  [I  [':  11  [1 

We  now  shill  the  segments  of  the  split  vector  right  by  one  and  insert  tin'  new  elements 
in  I  lie  left.  Ilecaiise  of  the  heap  order  of  T.  this  will  c;\use  eac  h  sc'gment  lo  go  to  its  child 
segment.  We  also  truncate  the  sc'gmenis  that  corre.spcjiid  lo  children  of  the  leaf  vertices. 
I'hese  calculations  would  return; 

children  =  [c/s  c/a]  []  [c/i]  [n»]  []  [u.a]  [] 

We  now  api)c  lid  the  shifted  vector  (c  hildri'n)  to  the  vector  of  eh'inents  that  rc'maiiied 
i  ii'inalii)  using  thc'  append  operation. 

I  he  following  roniinc'  can  !)e  used  to  exi'cniic'  a  step  ol  the  binary  M.-arcdi.  hhe  ri  iikiiiiY 
Hag  '  pi  I  i  lies  elei  I  leii  t.-,  lliat  slav.it  theeurreiil  vc'rlex.  ;inij  l  he  /////// H  Hag  spec  i  la's  eleineuts 
I'l.ii  go  III  till’  riglil  hrancli.  /tare  the  new  eleineniA  to  heiiiAeried  ai  root. 

define  search-stepf .  I .  /’.  II.  r<  n/d/n  r'.  i-li/hl  Y){ 
remain  —  packf.l.  ii  indiiiY): 
iiot-remain  —  packf.l.  notf /x///c////'r') ): 

i  hildreii  —  shift-segments-rightf //.  splitf  not-remain.  /■//////'/')); 
appendi  lemaiii.  c  hildren )} 
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A  (before)  =  [ao  «il  [rt':)  [e.T  e.t]  [)  ^'r,]  []  ['''7] 

F  =  [x  r]  [I]  [I  xl  fl  [x  xl  (]  [xl 

I!  =  «,,] 

,1  (after)  =  [<Js  a<,  0(,]  []  [e,  (/.,]  [<;,,)  far] 

1  10:  An  example  of  a  step  of  the  general  binary  search  technique.  We  keep  a  segment 

in  .1  for  each  vertex  of  the  tree  /'  such  that  segment  /  corresponds  to  vertex  r  Each  segment 
contains  all  elements  at  the  corresponding  vertex.  The  vector  I  iiulicates  an  example  of  where 
each  element  wants  to  go  during  a  step  of  the  search  (r  for  right.  I  for  left,  and  x  for  remain). 
The  vector  II  contains  new  elements  entering  at  the  root  of  the  search  at  that  step. 


i  T'. V  T.  >r.  w.  y,  K".  y,  v-.ttjv.  -f.  <rjv>.'^_  A  ■>  *  VXiT".  y'J»'«:i'V'^'^.-i. 


]  Ills  M'.iixh  ^lep  c;iii  1)0  um'<1  to  cxi'ciili-  tin-  l.iii;i:  i  I'.i:'  ;,'  (  ii' il  Ij\  ilii’  h  iiicil;,.’ 

liull  algorithm  ^ll^Cll.ssocl  in  S'-'Ctiun  I'oi  tin.  l.aii.  no  ■  '..•■in-ii t ~  i^nt  in-inU'il  ii,io 

t  lin  1  ixan  hut  sunii'  ('li'incnt  s  ilo  n'luaiii  at  tluur  cuiioni  voiiiv  ilm.nn  .1  o|  tin  hiii.os 

^(■a  rch. 

liiiia  I'V  nca  I'cli  ill  list  I'ti  tos  a  11  ini  port  a  nt  d  ilh'iiMicc  IhU  \>,  oiui  tin-  irriu  u  .1!  pi  o'^i  .1 1  iiiiii  a  li  ^t  ylu 
ui-c'tl  lor  roiinirroiit  i'('a(l  I’-HAM  modfl,^  and  lor  vector  inodel.-'.  In  t!ic  I’dIAM  model,  the 
inohloni  is  lu'sl  thought  of  as  11  independent  proce,.,.,e.s  e.n  1,  e\c,  niii.e  it,--  ouii  o-.iicii  on 
llic  tr('(' 7'.  In  iho  scan  model,  we  must  t  liink  of  t  lie  n  elc-ments  a>  a  set  and  hreaktiiat  si  i 
into  subsets  according  to  which  verte.x  of  /'  e.icli  eh'ment  i>  accex.-,ine .  I'his  mieht  jii>t  he 
a  idiilosophical  point,  hiil  we  believe  it  i,-.  im|>ortant. 

10  Conclusions 

1  his  paper  introduces  the  idea  of  a  vector  modc-l  of  comiiutation:  define>  a  particiihn  vector 
model,  t  he  scan-mtniel;  and  doscrilies  several  al.eorit  Inns  implemented  on  the  scan  model, 
Sinci.'  tiiany  of  tin.'  tileonthms  discussed  in  this  jiaper  are  variants  of  known  aleorithius. 
v.e  lielie\'e  that  much  of  the  coin  rihiilioii  c)l  this  paper  is  to  met  lioilology  lathr'i'  than  to 
algorithms.  I'he  code  we  show  in  this  paper  with  only  slight  syntactic  chaiiei's  has  hei'ii 
ii'-eil  to  implement  (he  algoridnns  ilescriln-d  on  ()je  (’onnectioii  .Machine. 

believe  that  the  algorithms  we  describe  are  very  jtraciical  for  implementation  on  a, 
wiile  r.iiige  of  architect  tires,  both  serial  and  ptirallel,  and  slionld  iti  most  ca.st's  be  almost  as 
fast  oil  a  partienhir  arcliilt'ct  ure  as  algorithm  designed  specilically  fur  that  tirchit ect  nre' ■ . 
1  his  eenenilily  is  one  of  the  niaia  ad vaiittiges  of  the  sean-mode)  over  the  J-’-ll.A.M  models. 
1  lie  ativanlage  arises  both  becaust'  the  scan-model  is  a  vector  model,  allowing  eflicit'iit  im¬ 
plement  a  I  ions  on  vector  processors  and  single  in  si  rue  I  ion  parallel  [irocessors,  ;i  nd  been  ust'  it 
! :  e.ii  s  t  he  .M'a  II  opei  at  ion  as  I  akiiig  no  more  t  inie  than  :»  permnt  at  ion.  a  1  ealist  ic  assumption 
|i  u  al mo- 1  all  a rc  hi lect  u  res, 

111  mole  recent  Work  we  liavc'  bei-n  con>ide|-i nc  the  effect  ol  incliidiiig  other  o|)('r:i I  ions 
11--  iiiiii  time  piimiiive-.  1  he  iiperaiioii  we  h.ive  1(11111(1  most  pionii^ing  is  a  variation  o\ 
lie'  iieuye  i  >]  u  ■  I  ,i  t  in  1 1 '  I  hi.--  opi  ration  can  he  implemeiili'd  ellicii'iillv  oil  a  v-ide  raiiyc  ol 

.1 1  chi  1  cc  1 11 1  e>  a  nil  i--  iisi'l  111  Ini  nia  n  V  algi  ml  hiiK.  I  o  i  mplemeni  t  In'  mere e  opt'fii  t  ion  on  serial 

,1 1  c  hi  I  I'c  I II I  e.-  recall  iiM’ t  he  >t  a  nda  rd  ini-i  'j,e  i  ipcral  ion.  a  iid  on  pa  ra  llel  a  rch  it  ect  n  res  we  ca  ii 

1  li  i>  I"  II'  >1  I  MIC  i'  ti  4  I «  111  I  'tI  Hits  unit  !•  >\v  1 1  •nil'-*  1 1  \  1 1  y  >n«  li  a>  K*  a  « v  In  t  ri  l  ii  i  -  s  or  t  n  «•  an  h  ii  it  I  u  irs. 

‘  <  1 1  s '.  II  t  \  <  t  III  >  .  \  a  ml  <  'f  n  ninl  ■■  I  .■>.  it  i  •  t  n  i  iis  a  vrclor  (  '  (tf  Iciyu,!  it  .1  with  imlic'-s  into  1  lif  vtA't  <  tr 

n  1  li'  iMili*  <'x  |imiiii  It)  \vli.  r<'  ill  }i  an  «  I'-iiu-nt  in  A 


? 


use  a  variation  ot  Hatclior's  Ijitonir  iiUMf^c  [7].  Alpsorit  Inns  to  const  i  iict  and  nianiindalc  t  in' 
plano-swc'ci)  tree  data  structure  [3,1,0,215]  can  be  greatly  siinpliRed  witli  a  unit-time  meiy;<' 
ojunalion.  We  liave  also  found  tlu‘  nierfi,'-  |)riinitive  iiselul  for  manipulating;  sets.  We  base 
also  considered  sorting  as  a  primitive,  but  we  find  it  hard  to  argue  that  sorting  should  be 
assumed  to  reciuire  the  same  time  as  a  permutation. 

We  hope  that  the  paper  will  helj)  spur  further  interest  in  designing  algorithms  for  vector 
models  of  computation. 
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